Greedy Gaussian Segmentation of Multivariate Time Series

نویسندگان

  • David Hallac
  • Peter Nystrup
  • Stephen Boyd
چکیده

We consider the problem of breaking a multivariate (vector) time series into segments over which the data is well explained as independent samples from a Gaussian distribution. We formulate this as a covariance-regularized maximum likelihood problem, which can be reduced to a combinatorial optimization problem of searching over the possible breakpoints, or segment boundaries. This problem is in general difficult to solve globally, so we propose an efficient heuristic method that approximately solves it, and always yields a locally optimal choice, in the sense that no change of any one breakpoint improves the objective. Our method, which we call greedy Gaussian segmentation (GGS), is quite efficient and easily scales to problems with vectors of dimension over 1000 and time series of arbitrary length. We discuss methods that can be used to validate such a model using data, and also to automatically choose appropriate values of the two hyperparameters in the method. Finally, we illustrate our GGS approach on financial time series and Wikipedia text data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmentation analysis on a multivariate time series of the foreign exchange rates

This study considers the multivariate segmentation procedure under the assumption of the multivariate Gaussian mixture. Jensen-Shannon divergence between two multivariate Gaussian distributions is employed as a discriminator and a recursive segmentation procedure is proposed. The daily log-return time series for 30 currency pairs consisting of 12 currencies for the last decade (January 3, 2001 ...

متن کامل

A Switched Gaussian Process for Estimating Disparity and Segmentation in Binocular Stereo

This paper describes a Gaussian process framework for inferring pixel-wise disparity and bi-layer segmentation of a scene given a stereo pair of images. The Gaussian process covariance is parameterized by a foreground-backgroundocclusion segmentation label to model both smooth regions and discontinuities. As such, we call our model a switched Gaussian process. We propose a greedy incremental al...

متن کامل

Energy Minimization by α-erosion for Supervised Texture Segmentation

In this paper we improve image segmentation based on texture properties. The already good results achieved using learned dictionaries and Gaussian smoothing are improved by minimizing an energy function that has the form of a Potts model. The proposed α-erosion method is a greedy method that essentially relabels the pixels one by one and is computationally very fast. It can be used in addition ...

متن کامل

Greedy decomposition integrals

In this contribution we define a new class of non-linear integrals based on decomposition integrals. These integrals are motivated by greediness of many real-life situations. Another view on this new class of integrals is that it is a generalization of both the Shilkret and PAN integrals. Moreover, it can be seen as an iterated Shilkret integral. Also, an example in time-series analysis is prov...

متن کامل

ESTIMATING SETARs WITH MULTIVARIATE THRESHOLDS

GRASP is a Greedy Randomised Adaptive Sampling Procedure that has been proposed to estimate parameters of self-exciting autoregressive threshold models (SETARs) with multivariate thresholds. We show that the GRASP procedure can often lead to an incorrect number of thresholds when estimating SETARs. Two simple modifications of the original GRASP procedure are suggested to overcome this problem. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016